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WEIGHTED FAIR QUEUE HAVING EXTENDED EFFECTIVE RANGE 



CROSS REFERENCE TO RELATED PATENT APPLICATIONS 

The present application is related to the 
following U.S. Patent Applications, each of which is hereby 
5 incorporated by reference herein in its entirety: 

U.S. Patent Application Serial No. , filed , 

titled "WEIGHTED FAIR QUEUE SERVING PLURAL OUTPUT PORTS" 
(IBM Docket No. ROC920010200US1) ; 

10 

U.S. Patent Application Serial No. , filed , 

u . titled "WEIGHTED FAIR QUEUE HAVING ADJUSTABLE SCALING 

O FACTOR" (IBM Docket No. ROC920010201US1) ; 

Jj£ 15 U.S. Patent Application Serial No. , filed , 

H titled "EMPTY INDICATORS FOR WEIGHTED FAIR QUEUES" (IBM 

Docket No. ROC920010202US1) ; 

s 

q U.S. Patent Application Serial No. , filed , 

J"; 20 titled "QoS SCHEDULER AND METHOD FOR IMPLEMENTING PEAK 
H SERVICE DISTANCE USING NEXT PEAK SERVICE TIME VIOLATED 

INDICATION" (IBM Docket No. ROC920010203US1) ; 

U.S. Patent Application Serial No. , filed , 

25 titled "QoS SCHEDULER AND METHOD FOR IMPLEMENTING QUALITY OF 
SERVICE WITH AGING STAMPS" (IBM Docket No. ROC920010204US1) ; 

U.S. Patent Application Serial No. , filed , 

titled "QoS SCHEDULER AND METHOD FOR IMPLEMENTING QUALITY OF 
30 SERVICE WITH CACHED STATUS ARRAY" (IBM Docket No. 
ROC920010205US1) ; and 
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U.S. Patent Application Serial No. , filed , 

titled "QoS SCHEDULER AND METHOD FOR IMPLEMENTING QUALITY OF 
SERVICE ANTICIPATING THE END OF A CHAIN OF FLOWS" (IBM 
Docket No. ROC920010206US1) . 

5 

FIELD OF THE INVENTION 

The present invention is concerned with storage 
and data communication systems and is more particularly 
concerned with a scheduler component of a network processor. 

10 

BACKGROUND OF THE INVENTION 

Data and storage communication networks are in 

y* widespread use. In many data and storage communication 

f*\ 

networks, data packet switching is employed to route data 

H 15 packets or frames from point to point between source and 

fp 

yi destination, and network processors are employed to handle 

^ transmission of data into and out of data switches. 

a FIG. 1 is a block diagram illustration of a 

C conventional network processor in which the present 

Q 2 0 invention may be applied. The network processor, which is 

p generally indicated by reference numeral 10, may be 

S i. 

constituted by a number of components mounted on a card or 
"blade" . Within a data communication network, a 
considerable number of blades containing network processors 

2 5 may be interposed between a data switch and a data network. 

The network processor 10 includes data flow chips 
12 and 14. The first data flow chip 12 is connected to a 
data switch 15 (shown in phantom) via first switch ports 16, 
and is connected to a data network 17 (shown in phantom) via 

30 first network ports 18. The first data flow chip 12 is 
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positioned on the ingress side of the switch 15 and handles 
data frames that are inbound to the switch 15. 

The second data flow chip 14 is connected to the 
switch 15 via second switch ports 20 and is connected to the 
5 data network 17 via second network ports 22 . The second 
data flow chip 14 is positioned on the egress side of the 
switch 15 and handles data frames that are outbound from the 
switch 15. 

As shown in FIG. 1, a first data buffer 24 is 
10 coupled to the first data flow chip 12. The first data 

buffer 24 stores inbound data frames pending transmission of 
the inbound data frames to the switch 15. A second data 
^ buffer 26 is coupled to the second data flow chip 14, and 

Q stores outbound data frames pending transmission of the 

5 ! 

0i 15 outbound data frames to the data network 17. 
W* The network processor 10 also includes a first 

?g processor chip 28 coupled to the first data flow chip 12. 

The first processor chip 28 supervises operation of the 

H first data flow chip 12 and may include multiple processors. 

O 

u 20 A second processor chip 30 is coupled to the second data 

"n 

flow chip 14, supervises operation of the second data flow 
chip 14 and may include multiple processors. 

A control signal path 32 couples an output 
terminal of second data flow chip 14 to an input terminal of 
25 first data flow chip 12 (e.g., to allow transmission of data 
frames therebetween) . 

The network processor 10 further includes a first 
scheduler chip 34 coupled to the first data flow chip 12. 
The first scheduler chip 34 manages the sequence in which 
3 0 inbound data frames are transmitted to the switch 15 via 
first switch ports 16. A first memory 36 such as a fast 
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SRAM is coupled to the first scheduler chip 34 (e.g., for 
storing data frame pointers and flow control information as 
described further below) . The first memory 36 may be, for 
example, a QDR (quad data rate) SRAM. 
5 A second scheduler chip 3 8 is coupled to the 

second data flow chip 14. The second scheduler chip 38 
manages the sequence in which data frames are output from 
the second network ports 22 of the second data flow chip 14. 
Coupled to the second scheduler chip 3 8 are at least one and 
10 possibly two memories (e.g., fast SRAMs 40) for storing data 
frame pointers and flow control information. The memories 
40 may, like the first memory 36, be QDRs. The additional 

lL memory 40 on the egress side of the network processor 10 may 

be needed because of a larger number of flows output through 

hi 15 the second network ports 22 than through the first switch 

jt! ports 16. 

H FIG. 2 schematically illustrates conventional 

queuing arrangements that may be provided for a data flow 
H chip/scheduler pair (either the first data flow chip 12 and 

r| 2 0 the first scheduler chip 34 or the second data flow chip 14 

Li 

^ and the second scheduler chip 38) of the network processor 

M 10 of FIG. 1. In the particular example illustrated in FIG. 

2, the first data flow chip 12 and the first scheduler chip 
34 are illustrated, but a very similar queuing arrangement 

25 may be provided in connection with the second data flow chip 
14 and the second scheduler chip 38. In the queuing 
arrangement for the first data flow chip 12 and the first 
scheduler chip 34, incoming data frames (from data network 
17) are buffered in the input data buffer 24 associated with 

30 the first data flow chip 12 (FIG. 1) . Each data frame is 
associated with a data flow or w f low" . As is familiar to 
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those who are skilled in the art, a *flow" represents a one- 
way connection between a source and a destination. 

Flows with which the incoming data frames are 
associated are enqueued in a scheduling queue 42 maintained 
5 in the first scheduler chip 34. The scheduling queue 42 
defines a sequence in which the flows enqueued therein are 
to be serviced. The particular scheduling queue 42 of 
interest in connection with the present invention is a 
weighted fair queue which arbitrates among flows entitled to 
10 a "best effort" or "'available bandwidth" Quality of Service 
(QoS) . 

As shown in FIG . 2, the scheduling queue 42 is 
□ associated with a respective output port 44 of the first 

data flow chip 12 . It is to be understood that the output 

01 15 port 44 is one of the first switch ports 16 illustrated in 

111 

i£ FIG. 1. (However, if the data flow chip/scheduler pair 

Sl under discussion were the egress side data flow chip 14 and 

jy, scheduler chip 38, then the output port 44 would be one of 

III the network ports 22.) Although only one scheduling queue 

H 2 0 42 and one corresponding output port 44 are shown, it should 

Q 

12 be understood that in fact there may be plural output ports 

and corresponding scheduling queues each assigned to a 
respective port. (However, according to an alternative 
embodiment, disclosed in co-pending patent application 

2 5 Serial No. , filed (Attorney Docket No. 

ROC920010200US1) , a group of output ports may be associated 
with each scheduling queue 42. This co-pending patent 
application is incorporated herein by reference.) 

Although not indicated in FIG. 2, the first 

30 scheduler chip 34 also includes flow scheduling calendars 
which define output schedules for flows which are entitled 
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to a scheduled QoS with guaranteed bandwidth, thus enjoying 
higher priority than the flows governed by the scheduling 
queue 42 . 

The memory 36 associated with the first scheduler 
5 chip 34 holds pointers ("frame pointers") to locations in 
the first data buffer 24 corresponding to data frames 
associated with the flows enqueued in the scheduling queue 
42. The memory 36 also stores flow control information, 
such as information indicative of the QoS to which flows are 
10 entitled. 

When the scheduling queue 42 indicates that a 
particular flow enqueued therein is the next to be serviced, 
reference is made to the frame pointer in the memory 36 
W corresponding to the first pending data frame for the flow 

M 15 in question and the corresponding frame data is transferred 
j;; from the first data buffer 24 to an output queue 46 

H associated with the output port 44 . 

ff* 

3 " A more detailed representation of the scheduling 

S 3 

queue 42 is shown in FIG. 3. As noted above, the scheduling 
Q 2 0 queue 42 is used for weighted fair queuing of flows serviced 

on a "best effort " basis. In a particular example of a 
H scheduling queue as illustrated in FIG. 3, the scheduling 

queue 42 has 512 slots (each slot represented by reference 

numeral 48) . Other numbers of slots may be employed. In 

2 5 accordance with conventional practice, flows are enqueued or 

attached to the scheduling queue 42 based on a formula that 
takes into account both a length of a data frame associated 
with a flow to be enqueued and a weight which corresponds to 
a QoS to which the flow is entitled. 

3 0 More specifically, the queue slot in which a flow 

is placed upon enqueuing is calculated according to the 
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formula CP + ( (WF x FS)/SF), where CP is a pointer ("current 
pointer 77 ) that indicates a current position (the slot 
currently being serviced) in the scheduling queue 42; WF is 
a weighting factor associated with the flow to be enqueued, 
5 the weighting factor having been determined on the basis of 
the QoS to which the flow is entitled; FS is the size of the 
current frame associated with the flow to be enqueued; and 
SF is a scaling factor chosen to scale the product (WF x FS) 
so that the resulting quotient falls within the range 
10 defined by the scheduling queue 42. (In accordance with 

conventional practice, the scaling factor SF is conveniently 
defined as a integral power of 2 - i.e., SF = 2 n , with n 
m being a positive integer -- so that scaling the product (WF 

pi x FS) is performed by right shifting.) With this known 

f zh 15 weighted fair queuing technique, the weighting factors 

£11 

|f| assigned to the various flows in accordance with the QoS 

assigned to each flow govern how close to the current 
* pointer of the queue each flow is enqueued. In addition, 

L flows which exhibit larger frame sizes are enqueued farther 

W 2 0 from the current pointer of the queue, to prevent such flows 
p from appropriating an undue proportion of the available 

bandwidth of the queue. Upon enqueuement, data that 
identifies a flow (the "Flow ID") is stored in the 
appropriate queue slot 48. 
25 In some applications, there may be a wide range of 

data frame sizes associated with the flows, perhaps on the 
order of about 64 bytes to 64KB, or three orders of 
magnitude. It may also be desirable to assign a large range 
of weighting factors to the flows so that bandwidth can be 
30 sold with a great deal of flexibility and precision. In 

such instances, it is desirable that the scheduling queue in 
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which weighted fair queuing is applied have a large range, 
where the range of the scheduling queue is defined to be the 
maximum distance that flow may be placed from the current 
pointer. As is understood by those who are skilled in the 
art, the scheduling queue 42 functions as a ring, with the 
last queue slot (number 511 in the present example) wrapping 
around to be adjacent to the first queue slot (number 0) . 

It could be contemplated to increase the range of 
the scheduling queue 42 by increasing the number of slots. 
However, this has disadvantages in terms of increased chip 
area, greater manufacturing cost and power consumption, and 
increased queue searching time. Alternatively, the 
resolution of the scheduling queue 42 could be decreased to 
increase the range, where "resolution" is understood to mean 
the inverse of the distance increment that corresponds to 
each slot in the queue. However, if resolution is 
decreased, flows that should be assigned different 
priorities according to their respective QoS may appear to 
"tie" by being assigned to the same slot, thereby improperly 
being assigned essentially equal service priority. Thus 
conventional practice in operating a weighted fair queue of 
the type described herein requires a tradeoff between 
resolution and range. In the particular example of a 
weighted fair queue as shown herein, having 512 slots, a 
distance increment of one distance unit may be assigned to 
each slot, in which case the queue has a range of 512. It 
would be desirable to increase the effective range of such a 
scheduling queue, without increasing the number of slots or 
decreasing the effective resolution. 
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SUMMARY OF THE INVENTION 

According to a first aspect of the invention, a 
scheduler for a network processor is provided. The 
scheduler includes one or more scheduling queues each for 
5 defining a sequence in which flows are to be serviced. At 
least one scheduling queue includes at least a first 
subqueue and a second subqueue. The first subqueue has a 
first range and a first resolution, and the second subqueue 
has a second range that is greater than the first range and 
10 a second resolution that is less than the first resolution. 

The first and second subqueues may, but need not, 
have equal numbers of slots. In one embodiment, the range 
H of the second subqueue may be sixteen times the range of the 

f*! first subqueue, and the resolution of the second subqueue 

15 may be one- sixteenth of the resolution of the first 
111 subqueue. Other relationships between range and/or 

" m resolution may be employed. For example, the range of the 

s second subqueue may be larger than the range of the first 

subqueue by any amount, and the resolution of the second 
H 2 0 subqueue may be less than the resolution of the first 

was 

O subqueue by any amount (e.g., regardless of the amount by 

SIS! b 

which the range of the second subqueue exceeds the range of 
the first subqueue) . However, in embodiments of the 
invention wherein the first and the second subqueues have 

25 the same number of slots, maintaining a direct inverse 

relationship between the resolution and the range of the 
second subqueue allows for an "effective" increase in 
scheduling queue range without an accompanying increase in 
consumed chip area. 

3 0 According to another aspect of the invention, a 

scheduler for a network processor is provided. The 
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scheduler includes a scheduling queue in which flows are 
enqueued according to the formula CP + ( (WF x FS) /SF) . In 
this formula CP is a pointer for indicating a current 
position in the scheduling queue, WF is a weighting factor 
5 associated with a flow appointed for enqueuing, FS is a 

frame size associated with the flow appointed for enqueuing, 
and SF is a scaling factor. The scheduling queue includes 
at least a first subqueue and a second subqueue . The flow 
appointed for enqueuing is enqueued to the first subqueue if 
10 the value of the expression ( (WF x FS)/SF) is less than a 
range of the first subqueue. The flow appointed for 
enqueuing is enqueued to the second subqueue if the value of 

^ the expression ( (WF x FS)/SF) is greater than a range of the 

p first subqueue. 

? n 15 Still another aspect of the invention provides for 

tr! a method of dequeuing a flow from a scheduling queue in a 

fn scheduler for a network processor. The method includes 

7 searching a first subqueue of the scheduling queue to find a 

M first winning flow in the first subqueue, and determining a 

jy 20 first queue distance corresponding to a distance between a 
U current pointer and a slot in which the first winning flow 

is enqueued. The method further includes searching a second 
subqueue of the scheduling queue to find a second winning 
flow in the second subqueue, and determining a second queue 
25 distance corresponding to a distance between the current 
pointer and a slot in which the second winning flow is 
enqueued. The method further includes comparing the first 
and second queue distances, and selecting for dequeuing one 
of the first and second winning flows based on a result of 
3 0 the comparing step. 
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By configuring a scheduling queue as two or more 
subqueues having different ranges and resolutions, the 
present invention allows the scheduling queue to offer an 
enhanced range, without adding to the total number of queue 
5 slots, while providing much of the benefit of high 
resolution. 

Other objects, features and advantages of the 
present invention will become more fully apparent from the 
following detailed description of exemplary embodiments, the 
10 appended claims and the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram of a conventional 
network processor in which the present invention may be 
15 applied; 

FIG. 2 is a block diagram representation of 
conventional queuing arrangements provided in a data flow 
chip/scheduler pair included in the network processor of 
U FIG. 1; 

rf 2 0 FIG. 3 is a pictorial representation of a weighted 

p fair queuing scheduling queue provided in accordance with 

conventional practices; 

FIG. 4 is a pictorial representation of a 
scheduling queue including subqueues provided in accordance 
25 with the invention; 

FIG. 5 is a diagram that illustrates sharing of a 
current pointer by the subqueues of the present invention; 

FIG. 6 is a flow chart that illustrates a process 
for attaching a flow to the inventive scheduling queue of 
30 FIG. 4; 
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FIG. 7 is a flow chart that illustrates a process 
for detaching a flow from the inventive scheduling queue of 
FIG. 4; and 

FIGS. 8A-8C pictorially represent a higher 
5 resolution subqueue "sliding" along a lower resolution 
subqueue as a current pointer advances. 

DETAILED DESCRIPTION 

Exemplary embodiments of the invention will now be 
10 described with reference to FIGS. 4-8C. 

FIG. 4 pictorially illustrates a scheduling queue 
5 0 that may be implemented in accordance with the invention 
lJL in the first scheduler 34 and/or the second scheduler 3 8 of 

U the network processor 10 of FIG. 1. As illustrated in FIG. 

|,i 15 4, the inventive scheduling queue 50 includes a plurality of 

cn 

g" n subqueues having different respective ranges and 

H resolutions. In the particular example illustrated, the 

HI 

scheduling queue 50 includes a higher resolution subqueue 52 
(also indicated as subqueue 0) and a lower resolution 
Q 20 subqueue 54 (also indicated as subqueue 1) . Because both 
pi subqueues 52 and 54 have, in the examples shown, the same 

H number of slots (e.g., 256 slots) the lower resolution 

subqueue 54 has a larger range than the higher resolution 
subqueue 52. In the particular example of FIG. 4, the lower 

2 5 resolution subqueue 54 has sixteen times the range and one- 

sixteenth of the resolution of the higher resolution 
subqueue 52. For example, each slot in the higher 
resolution subqueue 52 may correspond to one distance unit 
for enqueuing and dequeuing purposes, whereas each slot in 

3 0 the lower resolution subqueue 54 may correspond to sixteen 

distance units for enqueuing and dequeuing purposes. 
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Consequently, in the examples shown, the higher resolution 
subqueue 52 has a range of 256 and the lower resolution 
subqueue 54 has a range of 4096. 

It will be noted that the total number of slots of 
the two subqueues 52 and 54 is 512, which is the same as the 
number of slots of the prior art scheduling queue 42 shown 
in FIG. 3. Consequently, the inventive scheduling queue 50 
of FIG. 4 may be implemented using substantially the same 
resources in terms of chip area, manufacturing cost and 
power consumption as the prior art scheduling queue 42 of 
FIG. 3. Nevertheless, the inventive scheduling queue 50, as 
will be seen, provides a greatly increased effective range 
allowing for accurate enqueuing of low-priority flows, while 
also providing most of the benefits of the high resolution 
of prior art scheduling queue 42 of FIG. 3 in regard to 
accurate enqueuing of high-priority flows. Other numbers of 
slots also may be employed. 

Although the scheduling queue 50 is shown to have 
only two subqueues in the example of FIG. 4, it is 
contemplated to include three, four or more subqueues in the 
inventive scheduling queue, with each subqueue having a 
different range and resolution from the others. For 
example, a third subqueue could be added having a range 256 
times the range of the highest resolution subqueue. In 
addition, a fourth subqueue could be added having a range 
that is 4096 times the range of the highest resolution 
subqueue . 

The present invention also contemplates that a 
single pointer may be shared by two or more subqueues to 
indicate the current positions in the subqueues. In the 
example illustrated in FIG. 4, with two subqueues of 2 56 
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slots apiece, and the lower resolution subqueue 54 having 
sixteen times the range of the higher resolution subqueue 
52, a 12 bit pointer may be shared. Since 8 bits are 
required to define the 256 slot positions in each subqueue, 
the 8 least significant bits may be applied to the higher 
resolution subqueue 52 (subqueue 0) , and the 8 most 
significant bits may be applied to the lower resolution 
subqueue 54 (subqueue 1) . Thus the middle four bits are 
shared, as illustrated by the exemplary current pointer bits 
in FIG. 5. 

FIG. 6 is a flow chart that illustrates a process 
by which flows are attached to the inventive scheduling 
queue 50. As indicated at block 60 in FIG. 6, the process 
of FIG. 6 idles until it is time to attach (enqueue) a flow 
to the scheduling queue 50. Then, at step 62, a calculation 
is performed to determine the distance from the current 
pointer at which the flow is to be attached. The formula 
employed to calculate this distance is the previously- 
mentioned formula ( (WF x FS)/SF). That is, as noted above, 
the enqueuement distance is a product of a weighting factor 
WF applied to the flow based on its QoS and the size of the 
current frame FS, divided by a scaling factor SF. 

It is then determined, at block 64, whether the 
resulting value is less than the range of the higher 
resolution subqueue 52. If that is the case, then, as 
indicated by block 66, the flow is attached to the higher 
resolution subqueue 52 at the indicated slot (i.e., at the 
indicated distance from the slot number indicated by the 8 
least significant bits of the 12 -bit current pointer) . 

However, if at block 64 it is found that the 
enqueuement distance is greater than or equal to the range 

14 
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of the higher resolution subqueue 52, then block 67 follows 
block 64. At block 67 the enqueuement distance is divided 
by 16 (e.g., the enqueuement distance is "scaled") to 
reflect the decreased resolution of the lower resolution 
5 subqueue 54. Next, at block 68, the flow is attached to the 
lower resolution subqueue 54 at a slot represented by the 
sum (e.g., using modulo 256 in an embodiment in which the 
lower resolution subqueue 54 employs 256 slots) of the 
scaled enqueuement distance plus the value indicated by the 
10 8 most significant bits of the 12-bit current pointer, plus 
1. The addition of 1 to the sum of the scaled enqueuement 
distance and the current pointer value is performed to take 

q into account a preference that, as will be seen, is accorded 

to the lower resolution subqueue 54 during dequeuement in 

Cfl 15 case of a tie with the higher resolution subqueue 52. 

H FIG. 7 is a flow chart that represents a process 

for dequeuing (detaching or servicing) flows from the 

M inventive scheduling queue 50. As indicated by block 70 in 

FIG. 7, the process of FIG. 7 idles until it is time to 

jU 20 detach from the queue the flow which is closest to the 

a 

l.l current pointer (head) of the queue. When it is time to 

find and service the winning flow, block 72 follows. At 
block 72, both the higher resolution subqueue 52 and the 
lower resolution subqueue 54 are simultaneously searched for 

25 the closest flow to the current pointer of the queue. In 
one embodiment, the head of the queue for the higher 
resolution subqueue 52 is defined by the eight least 
significant bits of the current pointer (FIG. 5) , and the 
head of the queue for the lower resolution subqueue 54 is 

3 0 defined by the eight most significant bits of the current 
pointer (FIG. 5) . The distance from the head of the queue 
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to each subqueue's winning flow is determined in parallel. 
The two distances are normalized (that is, for the lower 
resolution subqueue 54 a hardwired left shift of 4 bits is 
applied, which is equivalent to multiplying the distance by 
5 16) . It is then determined whether the two distances result 
in a tie (block 74) . If there is a tie between the two 
subqueues, then the closest flow from the lower resolution 
subqueue 54 is selected for detachment (block 76) . The 
lower resolution subqueue 54 is allowed to win in case of a 
10 tie to prevent it from being "starved" by the higher 

resolution subqueue 52. That is, the lower resolution 
subqueue 54 could be pre-empted indefinitely by the higher 
fa resolution subqueue 52. 

III If there is no tie, then block 78 follows block 

H 15 74. At block 78 the winning flow is detached and serviced, 
jfll (It will be understood that the "winning flow" is the flow 

H that is closest to the head of the queue.) 

7 Following block 76 or 78, as the case may be, is 

f* block 80, at which the position of the current pointer is 

O 2 0 updated to the position of the slot from which the winning 
ipI flow was detached. (It should be noted that the process 

H described in connection with FIG. 7 assumes that the 

scheduling queue in question is not empty. The process 

further assumes that bandwidth is available, i.e., that 
25 there is no higher priority or scheduled service that 

preempts servicing of a winning flow from the scheduling 

queue . ) 

The processes of FIGS. 6 and 7 are suitable for 
use where a scheduling queue consists of two subqueues. 
3 0 However, those of ordinary skill in the art will readily 

recognize how the processes of FIGS. 6 and 7 can be extended 
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to management of a scheduling queue having more than two 
subqueues. The processes of FIGS. 6 and 7 may be 
implemented in hardware, software or a combination thereof. 
In at least one embodiment of the invention, the processes 
5 of FIGS. 6 and 7 are implemented in hardware employing a 

suitable combination of conventional logic circuitry such as 
adders, comparators, selectors, etc. Such hardware, for 
example, may be located within the scheduler 34 and/or the 
scheduler 38 (FIG. 2) . A person of ordinary skill in the 
10 art may develop logic circuitry capable of performing the 
inventive processes described with reference to FIGS. 6 and 
7. In a software embodiment of the invention, the processes 
~1 of FIGS. 6 and 7 may comprise one or more computer program 

O products. Each inventive computer program product may be 

fj\ 15 carried by a medium readable by a computer (e.g., a carrier 

wave signal, a floppy disk, a hard drive, a random access 
|Q memory, etc . ) . 

L FIGS. 8A-8C together schematically illustrate how 

the inventive scheduling queue 50 combines advantages of 
m 2 0 wide range with, in effect, high resolution. Lower 
H resolution subqueue 54, with each of its 256 slots 

corresponding to sixteen distance units, has a total range 
of 4096. As the current pointer advances, the higher 
resolution subqueue 52 in effect ''slides' 7 along the lower 
25 resolution subqueue 54, providing a high resolution of one 
distance unit per slot over a subrange of 256 distance 
units. That is, in FIG. 8A, higher resolution subqueue 52 
overlaps with slots 80-95 of lower resolution subqueue 54; 
in FIG. 8B, higher resolution subqueue 52 overlaps with 
3 0 slots 96-111 of lower resolution subqueue 54; and in FIG. 
8C, higher resolution subqueue 52 overlaps with slots 112- 
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127 of lower resolution subqueue 54. Thus for flows to be 
enqueued relatively close to the current pointer, high 
resolution is provided by the higher resolution subqueue 52. 
At the same time, the wide possible range of frame sizes and 
weighting factors are accommodated by the extended range of 
the lower resolution subqueue 54. It will be noted that the 
inventive scheduling queue 50 has the same number of 
physical slots, namely 512, as the prior art scheduling 
queue 42 of FIG. 3, yet a much larger range is provided in 
the inventive scheduling queue 50, while preserving much of 
the benefit of high resolution via the higher resolution 
subqueue 52. Other numbers of slots may be employed. 

The invention also reduces the search time 
required to examine the scheduling queue for entries, and is 
scalable to larger designs. Both subqueues can be searched 
at the same time since they are physically separate and are 
controlled through the same current pointer. Accordingly, 
adding more subqueues will lead to better performance when 
searching for entries in the scheduling queue because there 
is more searching in parallel. 

The foregoing description discloses only exemplary 
embodiments of the invention; modifications of the above 
disclosed apparatus and methods which fall within the scope 
of the invention will be readily apparent to those of 
ordinary skill in the art. For example, although in the 
above description the ratio of the respective ranges and 
resolutions of the subqueues was selected to be 16, other 
ratios may be selected, including 2, 4, 8, 32 or 64. It 
also is not necessary that the ratio be a power of 2. That 
is, no specific relationship between subqueue resolution and 
range is required, and a resolution of a subqueue may be 
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decreased independently of an amount by which a range of the 
subqueue is increased. As noted before, more than two 
subqueues having different ranges and resolutions may be 
included in the inventive scheduling queue. 

Moreover, in the above description, the inventive 
scheduling queue has been implemented in a separate 
scheduler chip associated with a network processor. 
However, it is also contemplated to implement the inventive 
scheduling queue in a scheduler circuit that is implemented 
as part of a data flow chip or as part of a processor chip 
in a network processor. 

Accordingly, while the present invention has been 
disclosed in connection with exemplary embodiments thereof, 
it should be understood that other embodiments may fall 
within the spirit and scope of the invention as defined by 
the following claims. 
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